arXiv: 1509.04805v2 [cs.IT] 13 Jan 2016 


1 


Energy-Efficient Cell Activation, User Association, 
and Spectrum Allocation in Heterogeneous 

Networks 

Binnan Zhuang, Dongning Guo, and Michael L. Honig 


Abstract —Next generation (5G) cellular networks are expected 
to be snpported by an extensive infrastructure with many-fold 
increase in the nnmber of cells per unit area compared to today. 
The total energy consnmption of base transceiver stations (BTSs) 
is an important issue for both economic and environmental 
reasons. In this paper, an optimization-based framework is 
proposed for energy-efficient global radio resource management 
in heterogeneous wireless networks. Specifically, with stochastic 
arrivals of known rates intended for users, the smallest set 
of BTSs is activated with jointly optimized user association 
and spectrum allocation to stabilize the network first and then 
minimize the delay. The scheme can be carried out periodically on 
a relatively slow timescale to adapt to aggregate traffic variations 
and average channel conditions. Numerical results show that the 
proposed scheme significantly reduces the energy consumption 
and increases the quality of service compared to existing schemes 
in the literatnre. 

I. Introduction 

Commercial wireless networks are evolving towards higher 
frequency reuse by deploying smaller cells to meet increasing 
demand for mobile data services. In a heterogeneous network 
(HetNet), macro cells provide for wide area coverage and 
for serving highly mobile users, whereas dense deployment 
of femto, pico, and/or micro cells, possibly with distributed 
antennas, can support much higher data rates per unit area. 

The energy consumption due to information and commu¬ 
nication technologies worldwide is rising rapidly Q. As the 
number of base transceiver stations (BTSs) increases, it is ever 
more important to manage their energy consumption. An active 
macro BTS consumes 40 to 80 watts on transmission, whereas 
the total power consumption is typically hundreds to well over 
1,000 watts, which includes the power for signal processing, 
computation, cooling, and radio frequency power amplification 
(see, e.g., and references therein). Hence, savings from 

transmit power control alone are relatively limited. Much more 
significant power savings can be accomplished by turning a 
BTS off or switching it to deep sleep mode. 

In this paper, we study how to support given traffic with as 
few active small cells as possible to conserve energy in a Het¬ 
Net. Because it may take many seconds to reactivate a BTS in 
deep sleep Q, the on-off decision should rely on the aggregate 
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traffic and average channels conditions collected over a period, 
possibly lasting a minute or more. Cell activation/deactivation 
thus occurs on a much slower timescale than channel-aware 
scheduling, which typically occurs over time slots of a few 
milliseconds ||8]. The central problem considered here is how 
to jointly optimize spectrum allocation and user association to 
minimize the number of active cells. Without loss of generality, 
we focus on downlink data transmissions. 

The problem formulation in this paper builds upon prior 
work 1^. The following two features distinguish this paper 
and |[9| from most existing work on energy-efficient resource 
management in the literature, including p0)-|[^. First, we 
consider stochastic packet arrivals to user traffic queues in 
lieu of static data rate requirements. Stochastic traffic better 
models the challenges for small cells as they have much 
more pronounced traffic variations than macro cells. The 
proposed slow timescale resource management adapts to the 
average traffic load, thus avoiding frequent on/off updates 
due to varying user equipment (UE) data rates. Second, the 
formulation here facilitates the maximum amount of spectrum 
agility. Specifically, by considering all possible reuse patterns, 
arbitrary (possibly nonconsecutive) spectrum can be allocated 
to a link from any BTS to any UE (see also f^, @). This 
is in contrast to the full-spectrum reuse (i.e., all BTSs use all 
available spectrum) assumed in ||l0)-||T9). 

The optimization problem formulated here is a mixed inte¬ 
ger program, where cell activation decisions are expressed as 
binary variables. The problem is solved numerically using an 
iterative algorithm based on reweighted ii minimization p3j . 
The method was interpreted as majorization-minimization 
in | [24l . Reference | [T3| uses the same method to solve the cell 
activation and user association problem under full-spectrum 
reuse and static rate requirements. 

Numerical results show that the proposed resource man¬ 
agement method achieves significant energy savings as well 
as throughput gains in a typical HetNet. In particular, the 
performance advantages of spectrum agility is demonstrated 
by comparing with the method in p3| . As previously noted 
in 0, spectrum agility is crucial for improving network 
throughput. Intuitively, improved spectrum allocation allows 
more BTSs to be turned off. Eurthermore, a silent BTS causes 
no interference to other cells, so that the remaining BTSs 
may attain higher spectral efficiencies | [25) . This may present 
additional opportunities for turning off BTSs and adding more 
energy savings. 

The proposed method offers a different tradeoff between 



2 



Fig. 1. The topology of a HetNet with ni = 2 macro BTSs, n 2 = 10 
randomly deployed pico BTSs and fc = 66 user groups. 


performance and complexity than in HD- The scheme in p3| 
is computationally feasible for a HetNet with hundreds of 
BTSs, whereas the method here is feasible for a cluster of 
at most 20 to 30 BTSs due to a large number of additional 
spectrum allocation variables in the optimization problem. 

The remainder of this paper is organized as follows. The 
system model is introduced in Section [I^ The optimization 
problems are presented in Section m and the algorithms are 
introduced in Section IV Simulation results are presented in 
Section |y) and conclusions are drawn in Section [VT] 


BTSs and k — 66 user groups is shown in Fig. [T] where 
each group of UEs is assumed to be located at the center 
of each hexagon (as in p5)). For ease of characterizing the 
delay as the objective, the aggregate traffic of group j UEs is 
modeled by Poisson traffic arrivals with arrival rate Xj. The 
packet length is exponentially distributed with average length 
L. It is possible to adopt a different traffic and queueing model 
(see, e.g., p^). 

We assume each BTS assigns different spectrum resources 
to different user groups. This can be viewed as statistically 
multiplexing the packet streams from different user groups. 
The packets from the same group of UEs are served according 
to a ‘first in hrst out’ policy. Hence, each user group effectively 
has a virtual queue. We also assume multiple BTSs can serve 
the same group of UEs|^ 

We assume BTS i transmits to its UEs with hxed flat 
transmit PSD, p^. The spectral efficiency of the link i —>■ j 
under pattern A depends on the receive power and the inter¬ 
ference. Eor concreteness in obtaining numerical results, we 
use Shannon’s formula to obtain; 


A - 


Wl{i e A) 

T ’ 


l0g2 1 + 


Pi9 


. 1^3 


Pi'9'' + '^3 


( 2 ) 


H. System Model 

We consider the downlink of a HetNet with n BTSs, 
including ni macro BTSs and n 2 = n — ni pico BTSs. Denote 
the set of macro and pico BTSs as A/j = {I,-- - ,ni} and 
= {rii -I- 1, • • • , n}, respectively. Let Af = {1, ■ • • , n} 
denote the set of all the BTSs. All BTSs operate on the same 
licensed band of W Hz. The frequency resources are assumed 
to be homogenous on a slow timescale. 

The key to total spectrum agility is the notion of pattern 0. 
A pattern over a set of time-frequency resources is a subset of 
transmitters sharing the resources. In the downlink, a pattern A 
is a subset of {1,..., n}, and all BTSs in A have simultaneous 
access to the frequency band associated with the pattern. 
Assuming known transmit power spectral densities (PSDs), 
a particular pattern determines the signal-to-interference-plus- 
noise ratio (SINK) and hence the spectral efficiencies of all 
links in the network. The allocation problem can then be 
formulated as how to divide the resources among all 2" 
patterns. In a simple example with only 2 BTSs, there are only 
patterns: {!}, {2}, {1,2}, and 0, which denote the patterns 
used respectively by BTS 1 and BTS 2 exclusively, the pattern 
used by both BTSs, and the pattern used by none, respectively. 
Eor every pattern A C Af, let ija be the fraction of total 
bandwidth allocated to it. Clearly, 

Y,yA = l, (1) 

AdN 

and any efficient allocation would set 2/0 = 0. 

At the slow timescale considered it is reasonable to treat 
users near each other with similar quality of service (QoS) 
requirements as a group. Denote the set of all user groups 
as /C = {1, • • • , k}. An example HetNet model with n = 12 


in packets/second, where l(i € A) = 1 if i S A and l(i ^ 
A) = 0 otherwise, is the power gain of the link i -A- 
j, and rij is the noise PSD at group j UEs. The link gain 
gi’^j includes pathloss and shadowing effects over the slow 
timescale considered in this paper. Hence is a constant 
in each decision period independent of the frequency. If small 
scale fading is included in on the slow timescale, then 
ergodic spectral efficiency must be used instead of (|^, since 
the decision period spans many coherence time intervals. 

All packets intended for group j UEs arrive at an M/M/1 
queue. Denote the bandwidth allocated to BTS i to serve group 
j UEs under pattern A as . The service rate for this queue 
contributed by BTS i under pattern A is A This rate 

is guaranteed regardless of the activities of the other BTSs and 
queues. The total service rate for queue j is a linear function 
of the bandwidths: 


Tj = x^^^ packets/second. (3) 

idM AcAf 


Hence the average packet sojourn time of the M/M/1 queue 
for group j UEs takes a simple form m- 


tj = 



seconds. 


(4) 


where (x)+ = max{0,x}, so Q = -foo if Vj < Xj. 


*On a slow timescale this can be realized by letting different BTSs serve 
different subsets of individual UEs. 
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III. Optimization Framework 


A. Baseline Formulation 


The basic energy-efficient resource allocation problem is 
formulated as: 


minimize 

'y ^iZi 

(Pla) 

r,x,v,z 

i^A/2 


subject to 

Zi G {0,1}, Vi G A /2 

(Plb) 


= E E Vj G IC 

iGAf AcAf 

(Pic) 


xE'’< 2 /a, yi G Af, A c Af 

(Pld) 


E E V* G A/j 

AcAf jeK 

(Pie) 


x"a^ >0, yiGAf, j GlC, AcAf 

(Plfj 


rj-Xj>T~^, yjGK, 

(Pig) 


E 1 

AcAf 

(Plb) 


where: 

• The variables r, x, y, and 2 : are the vector forms of 

i^A jGK:, AcAf’ (V^'^AcAT^ (^OigTVs’ 

respectively. 

. ( jPTal l is the total energy cost of the network, where 
Zi constrained by ( |Plb| l is a binary variable indicating 
whether pico BTS i is on or off, and Ci is the power cost 
of pico BTS i if it is active. While the macro BTSs are 
assumed to be always on to provide basic coverage, it is 
easy to include their on-off decisions as variables as well. 

• ( Plc| i is the service rate of each user group as given in ([^. 

• ( Pld| i guarantees that, for every pattern A, the total 
bandwidth a BTS allocates to all groups does not exceed 
the bandwidth assigned to pattern A. 

. dPT^ states that a pico BTS i uses no spectrum if it is 
off and uses at most 1 unit of bandwidth if it is on. 

• ( Plf| l constrains all bandwidths to be nonnegative. 

• ( Plg| l is the QoS constraint for each user group, where 
Tj is the maximum delay allowed for group j UEs. The 
constraint is equivalent to letting tj in Q satisfy 0 < 
Q < Tj. In lieu of QoS, quality of experience (QoE) can 
be considered using the proposed framework as well. To 
achieve this, each user group can be further divided into 
smaller groups representing different rate, delay and other 
QoS preferences. Also, QoE may be modeled a general 
user-dependent utility function Qj{Xj,rj). 

• ( |Plh| l constrains the total system bandwidth to be one 
unit. 


The formulation addresses joint user association, spec¬ 
trum allocation, and BTS activation all under one framework. 


The objective and all the constraints are linear. Solving PI 


minimizes the energy cost by turning off pico BTSs not needed 
to satisfy the delay requirements. 


very small networks. Eortunately, we can use Caratheodory’s 
Theorem to show that there exists an optimal allocation that 
uses no more than k patterns. 


Theorem 1: There exists an optimal solution of PI in which 
y is fc-sparse, i.e.. 


{A\yA>^, AcM} 


< k. 


(5) 


Proof: We first reformulate by defining a new set of 
variables which represent the fraction of spectrum 

under pattern A that BTS i allocates to group j. The con¬ 
straints (|Plc|l-(|PTf|l then become: 


= E E' 


, 0/0, J 


VA, Vj G IC 


AcAf \ieAf 




E 

EE^ 

AdM j£K 


^^3 


yiGff, AcAf 


VA < Zi, Vi G Af2 


Va ^ 


> 0, Vi G Jf, j G IC, A c Af. 


(P2a) 


(P2b) 


(P2c) 


(P2d) 


This problem, P2 where {v\ replaces {x\ is clearly 


equivalent to IPII Solving the actual spectrum allocations 
are given by = UaVa • 

In the remainder of this proof, we show that if a solution 


{r,v,y,z) to P2 exists, then there exists a A:-sparse y* and 
rate tuple r* such that {r*,v,y*,z) is feasible. This attains 
the original objective ( |PTal ) because 2 ; remains the same. We 
shall verify the feasibility constraints pertaining to {r*,y*), 
including ( |Plg| i, ( |Plh| i, ( |P2a| i, and ( |P2c| i. 

Suppose y has more than k nonzero elements. Let its 
support be S (yA = 0 if A ^ S). Let us define a fc-vector 
tA for every A G S with its elements determined by tj A = 




According to ( |P2a| l, a convex combination 
of the vectors (2yi)AG5 with {yA)A&s as coefficients form 
the optimal rate tuple: r = EacW VAtA By Caratheodory’s 
Theorem, r can be represented as a convex combination of 
no more than A: -f 1 of those vectors, denoted as 
Moreover, either r is on the boundary of the convex hull of 
(t^jAGS* or it is an interior point. In either case, there exists 
r* on the boundary that dominates r in every dimension. 
Clearly, r* is the convex combination of at most k vectors 
from (iA)AGS*- Therefore, there exists y* satisfying ( |Plh| ) 
with support S** C S* C S, such that |S**| < k and 

r-* = E y*A*A (6) 

ags** 

which implies ( |P2a| l. Because r* dominates r in every dimen¬ 
sion, ( |Plg| l is satisfied. It remains to show that ( P2c| l holds. Eor 
every i with Zi = 1, ( |P2c| l is satisfied due to ( |Plh| l and ( |P2b| l. 
Eor every i with Zj = 0, ( |P2c[ ) requires that = 0 for every 
j and every A G S with yA > 0, which implies that ( |P2c| ) 
remains true for y*, because its support is dominated by that 
of y. This completes the proof. ■ 


B. Structure of the Optimal Solution 


One concern with PI is that the spectrum is divided into up 
to 2” patterns (segments), which may be impractical for all but 


C. Comparison with Full Spectrum Reuse 

The algorithm to be introduced for solving ICT is related 
to the majorization-minimization approach in |13|, which is 
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based on reweighted minimization proposed in p3| . To fa¬ 
cilitate a fair comparison, we formulate an analogous problem 
to the one in m for cell activation and user association under 
the current framework. With minor changes, the optimization 
problem in 1131 becomes; 


minimize > CiZi (P3a) 

r,x,z 

subject to Zi s {0,1}, Vz G (P3b) 

Tj = ^ Vj G K (P3c) 

ViGAf (P3d) 

ie/c 

>0, Vi G Af, Vj G 1C (P3e) 

rj-Xj>T-\ VjG/C (P3f) 


where all variables have the same physical meaning as in PI 
The key change from to is that only full spectrum reuse 
is allowed, where the spectral efficiency of the link i —j is 
optimizes user association and the pico BTS on/off 


P3 


■ 

selection to minimize the energy cost. The performance of 
allocations based on PI and P3 will be compared in Section [V| 
using numerical examples. 


D. Generalization of the Framework 

The objective ( |Pla| l is equal to the weighted norm of 2 :: 

X! G\zi\o (7) 

ieM2 


where for any real number z, \z\o = 0 if 2 = 0 and = 1 
otherwise. Hence PI admits an equivalent formulation with 
the objective function changed to Q and with ( |Plb[ ) relaxed 
to G [0,1], i G A^- 

l^can also be generalized to utility functions of the form: 


Ci\zi\o +f{r,x,y,z) (8) 


subject to the same constraints, where z^G [0,1] indicates the 
fraction of the total bandwidth used by BTS i. The objec¬ 
tive (HJ is highly versatile. For example, letting /(r, x, y, z) = 
SiGA/ accounts for the transmit power consumption being 
proportional to the bandwidth allocation; letting 


f{r,x,y,z) = PYI 

jeK 


A, 1 

fj — Xj 


(9) 


incorporates the cost of delay in the objective, where j3 
accounts for the tradeoff between energy and delay. 


A. Algorithm Based on Reweighted Minimization 


The basic algorithm for solving PI consists of iterating 
between solving a convex optimization problem with weighted 
norm relaxation of the objective 0, and updating the 
weights. The continuous convex optimization problem based 
on relaxation is: 


minimize 

r.x,y.z 


iG7V2 


subject to Zi > 0, i G A^ 

(|PTc|)-(|PTh| 


(P4a) 

(P4b) 


where the objective function becomes a weighted £i norm of 
z in lieu of the weighted fg norm. 


Algorithm 1 Reweighted £i minimization. 


INPUT; 




A 


J j^jc, agM 

OUTPUT: (r, x, y, z). 

Initialization: 4 — 1, Vz G A^, 


> and 


U ^ 4— 0, ZZ° 4 — X]iG7y2 

while t < T and 


and f 4 — 1. 
> El do 


1. Compute (r‘, a;*, y*, z*) and the corresponding 


P4 


optimum u by solving 

2. Update the weights 

3. t i — f -f 1. 

end while 


with {wl)i 


GA /2 

ny wl'^^ 4 — ^ 




as the weights. 

Vz G A^- 


Algorithm states the iterative procedure. It starts with the 
weights m as a vector of all ones. In the tth iteration, the 
algorithm hrst computes (r‘, a;*, y*, z‘) by solving ( |P4| l with 
lu* as the weights. In fact, this is a simple linear program. 
The weights are then updated as 4— l/(z| -f £ 2 ), where 
£2 is some small number. If £2 = 0, the weight Wi is the 
inverse of Zi. Hence WiZi is a good approximation of |zi|o. 
We can also view Wi as a penalty term. If z‘ is large, then 
« 0, whereas if z| is small, then ^ 1, so that 

Zi is more likely to be driven to 0 in the following iteration. 
The algorithm terminates if either the maximum number of 
iterations T is reached or convergence (according to some 
predehned threshold £ 1 ) is achieved]^ 

An alternative explanation of Algorithm based on the 
majorization-minimization approach is given in fl?) . It makes 
use of the following property of the £q norm |23|, | |24) : 

|z|o = lirn5e(z) (10) 


for z > 0, where 


IV. Reweighted £i Minimization 

With the weighted £q norm Q as its objective, PI is a mixed 
integer program. It is generally difficult to solve due to its 
combinatorial nature. Such optimization problems frequently 
appear in sparse signal recovery, portfolio optimization, and 
statistical estimation. In this paper, we present an algorithm 
based on a low-complexity method, called reweighted £\ 
minimization 


9e{z) = 


l0g(l-|-Z£ 

l0g(l -f £-1) ■ 


( 11 ) 


The majorization-minimization method can be regarded as 
solving a sequence of minimization problems, where in each 
instance it minimizes a surrogate function that locally ma¬ 
jorizes the true objective function (see |24|, p8) for more 

^Here, convergence refers to the objective not necessarily the variables. 
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details). For a concave and differentiable function /, a simple 
and effective majorization-minimization update is: 

= argmin/(z‘) + A/(z‘)^(z — z‘), (12) 

zee 


where is the feasible set of the problem. Since for every e > 
0, is concave on [0,(X)), substituting 0 and ([^ into 
we obtain: 


z*"*"^ = argmin -< 

2GC Zi + €2 


(13) 


where C is the feasible set o f |P4| It is easy to see that ( [T3| ) 
is exactly what Algorithm computes in each iteration. 
Hence Algorithm [T] can be viewed as using the majorization- 
minimization method to minimize the approximate objective: 
/(^) = J2ie^^2^^9e2{z^). The concavity of /(z) and ([0 
imply /(z‘+^) < which establishes the convergence 

of Algorithmic 


B. A Refined Algorithm 

Algorithm [C mainly deals with the combinatorial nature 
of ICC introduced by the binary variables z. However, the 
number of variables in P4 is 0(fcn2") due to the 2" frequency 
patterns. We next reduce the complexity and improve the 
convergence speed somewhat using the fact that switching 
each BTS off halves the number of available patterns. 

There are two important properties of |P4| an d Algorithm |C 
First, if Zi = 0, according to ( |Pld| i and ( |Ple| i, then i/a = 0 
for every A that contains i. Second, if £2 is close to 0, once 
Algorithm 1 decides to switch a certain BTS off in some 
iteration, it is unlikely to be reactivated in later iterations. This 
is because the weight £ 2 “^ corresponding to Zi = 0 is typically 
much larger than that corresponding to any other nonzero 
element. Based on these two properties, a refined version of 
Algorithm 1C is proposed in Algorithm |C which reduces the 
dimensions of the feasible set during the iterations. 


Algorithm 2 A refined algorithm. 




'A I 


i&N, jelc, AcAf 


, and (ci)igA/-,. 


INPUT: 

OUTPUT: (r, a;, y, z). 

Initialization: ^ 1, Vi G 

u~^ ^0, u° ^ LiGA/'s ^ ^ 1- 

while t <T and \vf~^ — > £1 do 

1. Compute (r‘, a;*, z‘) and the corresponding 

optimum u* by solving 


P4 


with (wDzga ^2 weights. 


„*+i 


2. Update the weights by w' 
if 3i G A /2 s.t. z| = 0, and J2i-. 
A /2 ^ — {i I z/ > 0, i G A/*2} 


z*-|-e 2 ■ 


then 


A/" A/j U A^ 

end if 

3. t i — f -f 1. 

end while 


The only change from Algorithm |C is that the BTSs turned 
off at the end of each iteration are eliminated from future 
optimizations if a certain condition is met. Namely, if some 
Zi becomes zero in any iteration, we then compare the sum 


of the penalties on all nonzero components, J2i-z 
against the penalty on any zero component up to a scale factor, 
namely, a/€ 2 - If the total penalty on the nonzero terms is 
smaller, the pico BTSs with zero Zi will be ignored in future 
iterations, i.e., future optimizations of|PCwill be considered 
for the reduced pico BTS set A 2 = {* | z^ > 0, i G A/ 2 }. 
Analogous to Algorithm |C Algorithm |C can be regarded as 
updating a sequence of objective functions /* in the same 
form, but with A /2 updated in each iteration. With slight abuse 
of notationjCwe have /*(z:*) = /^(z‘). It is also easy to show 
that < /‘+^(z*) = /‘(z:*) < /*(z‘“^). Hence the 

values of the approximate objective after each iteration 
form a monotonically decreasing sequence, which establishes 
the convergence of Algorithm]^ 

To avoid a premature reduction of the feasible set, the 
weights on active and inactive BTSs are compared in Algo¬ 
rithm^ When J2i- < a/€ 2 , it is easy to see that 

an ofraTS in the current iteration will not be turned back on 
in the next iteration according to Algorithmic This is because 
the total reduction of the objective achieved by turning off all 
currently active BTSs in the next iteration is not enough to 
compensate for the increased cost due to reactivating a single 
currently inactive BTS. By setting the scale factor a small 
enough, the off BTSs that are eliminated from the feasible 
set are unlikely (albeit still possible) to be turned on again 
according to Algorithm |C If I pico BTSs are removed from 
A/ 2 , the number of variables is reduced by a factor of 2\ 
which results in a 2* complexity reduction in future iterations. 
The performance of Algorithms|C and |C will be compared in 
Section |V] Algorithms |C and |2]can be used to solve |^ if 
the function / in ( |P4a[ l is convex. The only difference then is 
that the continuous optimization problem in each iteration is 
a convex optimization problem instead of a linear program. 


C. Post Processing 


The post processing in | |T3| rounds up the continuous relax¬ 
ation of user associations to binary associations. This is not 
an issue in the model considered here since user association 
over a slow timescale is indirectly determined by the amount 
of resources allocated to each group. However, a different 
post processing can be used to improve the delay performance 
without increasing the energy cost, since the objective ( |PTal ) 
only depends on the binary variables z. 

Here post processing is executed after a solution of |^ 
is obtained. The process is performed over the subset of 
active BTSs, with all off BTSs removed from the formulation. 
Specifically, the post processing is to solve the following 
problem (cf 0): 


E Xj 1 

=—-—— •-— 

jgK L/gK 

subject to ( |Plc| i- dPTiil i 


(P5a) 


where objective ( |P5a| l is the average packet sojourn time in 
the network. 1^ is a convex program and is much easier to 
solve than IPlI 


corresponds to the (possibly reduced) set A/ 2 - Here is evaluated 

by setting any 2 ^ not in the original A ^2 to zero. 
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TABLE I 

Parameter coneigurations. 


Parameter 

Value/Function 


macro transmit power 

46 dBm 


pico transmit power 

30 dBm 

8 

total bandwidth 

10 MHz 


average packet length 

0.5 Mb 


macro to UE pathloss 

128.1-1-37.6 logip(i(;) 

1 ® 

pico to UE pathloss 

140.7-1-36.7 login (iJ) 

CD 

> 


We shall see in Section |V] that post processing can greatly 
improve the delay performance in the light traffic regime. 
The post processing is helpful if the objective only depends 
on the fo norm. We can also consider the generalizations in 
Section |III-D to minimize the total combined cost of energy 
and delay. 



- solving P2 using Algorithm 1 

- solving P1 using linear integer solver 

■ solving P1 using Algortihm 1 

■ solving P1 using Algortihm 2 



1 1.5 2 2.5 3 3.5 4 

average packet arrival rate per user type (packets/second) 


V. Numerical Results 

In this section, we demonstrate the effectiveness of the 
proposed method through numerical examples. In particular, 
we compare the performance of Algorithms and 1^ for 
solving as well as the performance of Algorithm m for 
solving [P^ which basically corresponds to the scheme of||13). 

A. Simulation Setup 

The simulation is carried out over the network in Fig. 
The HetNet is deployed on a 500 x 1000 m^ area. The entire 
geographic area is divided into a hexagonal grid with 66 
hexagons. Each hexagon represents a user group. The UEs 
within each hexagon/user group are assumed to be at the center 
of the hexagon. The BTSs are assumed to be at the vertices 
of the hexagons. There are two macro BTSs in the HetNet 
denoted by the dark squares. Ten pico BTSs are randomly 
placed in the network denoted by the triangles. 

The spectral efficiency is calculated by (|^ with a 30 dB 
cap on the receive SINR (i.e., an SINR greater than 30 dB 
is regarded as 30 dB). The pathloss models used for macro 
and pico BTSs are the urban macro (UMa) and urban micro 
(UMi) models specified in p^ . Other parameters used in the 
simulation are given in Table^ which are also compliant with 
the LTE standard | [29| . In the simulation, we only consider 
pathloss without slow and fast fading. 

B. Energy-Efficient Spectrum Allocation 

Fig. 1^ illustrates the energy cost due to different alloca¬ 
tion schemes at different traffic intensities. In the simula¬ 
tion, a random vector [oi,--- ,ak] is first generated with 
E[ai] = 1, i = I,-- - ,k. Given the average arrival rate A, 
let A = A[ai, • • • , a^], i.e., the arrival rates of all user groups 
are scaled proportionally with A. The delay requirement for all 
UEs is 0.5 seconds, i.e., Tj = 0.5, Vj G 1C. The energy costs 
of all pico BTSs are set at one unit, i.e., Ci = 1, Vz G A^. 
Hence the y axis also indicates the number of active pico 
BTSs. The curve marked by circles is obtained by solving [P^ 
using Algorithm which can be interpreted as applying the 
scheme of | [T3| to the scenarios presented in this paper. The 
two curves marked by squares and triangles are achieved by 


Fig. 2. Comparison of energy costs of different allocation schemes. 


solving lousing Algorithms [T] and respectively. These two 
curves coincide because those two algorithms yield the same 
energy savings in this case. The curve without any marker 
is achieved by solving using a standard integer program 
solver (to be specified shortly). 

According to Pig. the solution to PI greatly outperforms 
the solution to The solution to P3 can only support up to 
1.4 packets/second per user group. In contrast, the proposed 
scheme here can serve up to 4.3 packets/second per user group, 
a three-fold throughput gain. 

The results obtained using Algorithms and to solve PI 


are very close to the solution obtained using a standard integer 
program solver. (We observe that at most one extra pico 
BTS is turned on in all traffic regimes.) Similar results are 
observed with random realizations of the network and traffic 
distribution. 

The spectrum allocation and user association according to 
the solution to PI are shown in Pig.|^ Each pie chart indicates 
the spectrum allocation at the corresponding user group. The 
different colors represent different patterns, and the amount of 
spectrum resources allocated to each group under each pattern 
is denoted by the size of the corresponding pie. The average 
packet arrival rate of each group is shown by the number above 
the pie chart. Each line segment joining a BTS and a group 
means the group is served by the BTS. The color bars on top of 
each hgure shows the actual spectrum partition into different 
patterns. 

In both Eig. 3a and Eig.[^ the spectrum resources allocated 
to each user group is roughly proportional to the corresponding 
traffic demand. The light traffic scenario is shown in Eig. 

All the pico BTSs are turned off, leaving the 2 macro BTSs to 
serve all the user groups. The spectrum in Eig. 3a is divided 
into two segments each exclusively used by one of the macro 
BTSs. Apparently, the allocation is suboptimal in terms of user 
association, since both macro BTSs intrude into the other cell 
to serve some user groups that should obviously be served 


by the other macro BTS. This is because the objective in PI 


is only to minimize the number of active pico BTSs. The 
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(a) average packet arrival rate = 0.5 packets/second (b) average packet arrival rate = 4.3 packets/second 

Fig. 3. Spectrum allocation and user association before post processing in light and heavy traffic regimes. 



(a) average packet arrival rate = 0.5 packets/second (b) average packet arrival rate = 4.3 packets/second 

Fig. 4. Spectrum allocation and user association after post processing in light and heavy traffic regimes. 


algorithm terminates as soon as a feasible spectrum allocation 
is found that satisfies the user delay requirements using the 
minimum number of pico BTSs. The heavy traffic scenario is 
shown in Fig. 3b The spectrum allocation is topology aware. 
Each pico BTS serves nearby user groups. Macro BTSs then 
serve the user groups in coverage holes of the pico BTSs. The 
two pico BTSs turned off are near clusters of other pico BTSs. 
The spectrum is orthogonalized among nearby user groups, 
and is efficiently reused by user groups that are far away. 

The spectrum allocations in the light and heavy traffic 
regimes after the post processing for further delay improve- 

The 
The 


ment, as described in Section IV-C are shown in Fig. B 
allocation in the light traffic regime is shown in Fig. [4a 
spectrum is still divided into 2 segments. However, instead of 
assigning each segment to each BTS exclusively, one segment 
is shared by both BTSs and the other is exclusively used 
by the macro BTS on the left. The post processing achieves 


fractional frequency reuse, i.e., full frequency reuse among 
all user groups in cell centers, whereas user groups at cell 
edges are served with the spectrum exclusively allocated to 
the left macro BTS. The reason that cell edge users in the 
right macro cell are also served by the left macro BTS is due 
to the relatively small network size. 


The spectrum allocation after post processing shown in 
Fig. 4b is similar to that in Fig. 3b This is because as the traffic 
gets close to the maximum throughput, there is little margin 
to further reduce the delay once the delay requirements are 
met. The post processing reduces the average packet sojourn 
time in the network from 0.49 seconds to 0.29 seconds in the 
light traffic regime. However, it only reduces the delay from 
0.5 seconds to 0.45 seconds in the heavy traffic regime. 































































Fig. 5. Runtime comparison at different traffic intensities. 

TABLE II 

Number of iterations using Algorithm[T]and Algorithm[2] 


all UE groups and the spectral efficiency of all links (i.e., 
A and s) in order to perform the proposed energy-efficient 
global resource management. Location information can help 
to identify users from different groups, which can be acquired 
using standard positioning schemes. The traffic information 
for each UE group can be measured at its serving BTS. The 
link gains are routinely measured by the BTSs. If BTS i does 
not receive signals from UE group j, then the gain of link 
i > j can be regarded as zero. 

On the slow timescale considered, the overhead of storing 
and forwarding the aforementioned information over backhaul 
links is quite small. Eor example, to describe 10,000 param¬ 
eters (32 bits each) once every minute translates to about 5 
kilobits per second (kbps). The decision variables need to be 
fed back to each BTS after solving the optimization problem. 
The number of variables sent to each BTS is at most O(fc^). 
Even with a million variables per minute, the overhead is 
merely 500 kbps. 

VI. Conclusion 


arrival rate (packets/second) 

0.48 

1.43 

2.39 

3.34 

4.30 

iterations for Algorithm ] 


2 

2 

13 

57 

102 

iterations for Algorithm 2 


2 

2 

9 

6 

13 


C. Runtime Considerations 

The runtimes of the four schemes are shown in Eig. The 
curves are marked in the same way as in Eig. The optimiza¬ 
tion problems were solved in Matlab using CVX from CVX 
Research, Inc. on an Intel Core i7 2.4 GHz quad-core computer 


with 16 GB RAM. The continuous linear program P4 in the 
iterations of Algorithms [T] and was solved by the default 
linear program solver of Gurobi p0[ . The parameters in 
Algorithms and are T = 200, ei = £2 = 10“® and 
a = 0.1. The curve without marker was solved by the default 
linear integer program solver of Gurobi. 


Although the solution to P3 is suboptimal in both energy 
savings and network throughput, it can be solved much faster 
than the other three, and can be applied to large systems 
with hundreds of BTSs GD- Among the remaining three 
algorithms. Algorithm |^s runtime is the most favorable under 
all traffic conditions. 

The number of iterations when using Algorithms [T] and is 
shown in Table [I^ In the light traffic regime, both algorithms 
converge within two iterations. In fact, both algorithms imme¬ 
diately realize that all pico BTSs should be turned off under 
such light loads. Hence the reduction of the feasible set in 
Algorithm 1^ never happens. In the moderate and heavy traffic 
regimes. Algorithm]^ takes fewer iterations due to the feasible 
set reduction. The runtime using Algorithm]^ is reduced by as 
much as 42 times compared to Algorithm [T] as shown in Fig. 
The shorter runtime is due to both faster convergence and 
lower computational complexity in each iteration with reduced 
dimensions. 


D. Overhead 

A central controller needs to know the traffic intensity of 


Traffic-driven radio resource management on a slow 
timescale has been proposed and studied for improving en¬ 
ergy efficiencies in HetNets. Joint spectrum allocation, user 
association, and cell activation significantly reduce energy 
cost and improve system throughput. The proposed algorithms 
can efficiently optimize a network cluster with up to 20 
BTSs. The improved performance is at the cost of increased 
computational complexity for exploiting spectrum agility. 

The main drawback of the current problem formulation is 
that the complexity scales exponentially with network size 
due to the combinatorial patterns. In practice, many of the 
2" patterns in a large HetNet can be easily ruled out, i.e., two 
BTSs far apart will not interfere with each other, and a UE 
will not be served by a BTS far away. Extending this approach 
to much larger HetNets is ongoing work. The general resource 
management framework has also been extended to mixed fast 
and slow timescales in preliminary work GD- 
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